Visualizing I/O Predictability
نویسندگان
چکیده
We propose a novel method to study storage system predictability based on the visualization of file successor entropy, a form of conditional entropy drawn from a file access trace. First-order conditional entropy can be used as a measure o f predictability. It is superior to the more common measures such as independent likelihood of data access. For file access data, we developed a visualization tool that produces 3D graphical views of the variation in predictability of successive access events on a per-file basis. Our visualization tool provides interactive observation of the variations in predictability according to some arbitrary criterion, e.g. time of day, program identifier, user groups, or any other classification of files. Four entropy data sets were extracted from various file system traces. These four data sets are representative of the variability in file access patterns for different machine use: server, personal workstation, large number of interactive users, and heavy write activity. Visualization results show that there is strong predictability among files and optimizations would be profitable.
منابع مشابه
Visualizing cache effects on I/O workload predictability
We describe our experience graphically visualizing data access behavior, with a specific emphasis on visualizing the predictability of such accesses and the consistency of these observations at the block level. Such workloads are more frequently encountered after filtering through intervening cache levels and in this paper we demonstrate how such filtered workloads pose a problem for traditiona...
متن کاملVisualizing Time Series Predictability
Predicting how temporally varying phenomena will evolve over time, or in other terms forecasting, is one of the fundamental tasks in time series analysis. Prediction has gained particular importance with the advent of real time data collection activities. Although there exist several sophisticated methodologies to predict time series, the success of a predictive analysis process remains mostly ...
متن کاملVisualizing I/O Performance During the BGL Deployment∗†
Among the many challenges in getting BlueGene/L into production was the initial under-performance of the Lustre-based parallel file system. This report focuses on the role of visualization tools that assisted in understanding and improving performance. The authors were able to bring about a two-fold increase in the delivered I/O performance.
متن کاملVCDC: The Virtualized Complicated Device Controller
I/O virtualization enables time and space multiplexing of I/O devices, by mapping multiple logical I/O devices upon a smaller number of physical devices. However, due to the existence of additional virtualization layers, requesting an I/O from a guest virtual machine requires complicated sequences of operations. This leads to I/O performance losses, and makes precise timing of I/O operations un...
متن کاملReturn Value Predictability Profiles for Self-healing
Current embryonic attempts at software self–healing produce mechanisms that are often oblivious to the semantics of the code they supervise. We believe that, in order to help inform runtime repair strategies, such systems require a more detailed analysis of dynamic application behavior. We describe how to profile an application by analyzing all function calls (including library and system) made...
متن کامل